A System for Extracting and Ranking Name Aliases in Emails

نویسندگان

  • Meijuan Yin
  • Xiaonan Liu
  • Junyong Luo
  • Xiangyang Luo
چکیده

Mining potential information about person identity in emails is one of the popular research topics in email mining. This paper focuses on mining name aliases of a user from emails. Firstly, a system for extracting and ranking name aliases is proposed, which includes two modules: the Alias Extraction Module and the Alias Authority Ranking Module. Secondly, the methods used in the Alias Authority Ranking Module to rank the authority of name aliases of a user are presented in detail, which are based on email communication relation analysis and morphologically similar alias clustering. At last, we evaluate the proposed methods on the public subset of the Enron corpus. Experiment results show that the proposed system can efficiently extract name aliases and find the authoritative aliases of a user.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Detection of Name Disambiguation and Extracting Aliases for the Personal Name

An individual can be referred by multiple name aliases on the web. Extracting aliases of a name is important in information retrieval, sentiment analysis and name disambiguation. We propose a novel approach to find aliases of a given name using automatically extracted lexical pattern based approach. We exploit set of known names and their aliases as training data and extract lexical patterns th...

متن کامل

Automatically Extracting Personal Name Aliases from the Web

An entity can be referred by multiple name aliases on the web. Extracting aliases of an entity is important for various tasks such as identification of relations among entities, automatic metadata extraction and entity disambiguation. To extract relations among entities properly, one must first identify those entities. Aliases of an entity are useful as metadata for that entity and can be used ...

متن کامل

Automatically Extracting Personal Name Aliases from the Web

Extracting aliases of an entity is important for various tasks such as identification of relations among entities, web search and entity disambiguation. To extract relations among entities properly, one must first identify those entities. We propose a novel approach to find aliases of a given name using automatically extracted lexical patterns. We exploit a set of known names and their aliases ...

متن کامل

Identification of Personal Name Aliases on the Web

Extracting aliases of an entity is important for various tasks such as identification of relations among entities, web search and entity disambiguation. To extract relations among entities properly, one must first identify those entities. We propose a novel approach to find aliases of a given name using automatically extracted lexical patterns. We exploit a set of known names and their aliases ...

متن کامل

A Co-occurrence Graph-based Approach for Personal Name Alias Extraction from Anchor Texts

A person may have multiple name aliases on the Web. Identifying aliases of a name is important for various tasks such as information retrieval, sentiment analysis and name disambiguation. We introduce the notion of a word co-occurrence graph to represent the mutual relations between words that appear in anchor texts. Words in anchor texts are represented as nodes in the co-occurrence graph and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • JSW

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2013